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Abstract 

The paper considers a linear regression model in high-dimension with multiple change-points 
and heavy-tailed errors. Least squares method with LASSO or adaptive LASSO penalty can not 
be used since the theoretical assumptions do not occur and the estimators are not robust. Then, 
the quantile model with SCAD or LASSO-type penalty allows in the same time to estimate the 
parameters on every segment and eliminate the irrelevant variables. The oracle properties are 
not affect by the change-point estimation. Monte-Carlo simulations shown the performance of 
the methods. 
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1 Introduction 

In recent years, the change-point models and high-dimension regression have received much at- 
tention in the literature, most often in the case of a model with zero mean errors and bounded 
variance. A L\ or adaptive L\ penalty in the context of least squares model can be considered. 
We obtain then the popular method introduced by Tibshirani(1996) and called LASSO (Least Ab- 
solute Shrinkage and Selection Operator) method. It is well known than the outliers may cause 
a large error in a least squares estimator. This can happen especially when the error distribution 
is not Gaussian and distribution tail is large enough. An alternative method is then the quantile 
estimation. 

Study in this paper was motived by wishes to find the properties of the two estimators, particu- 
larly interesting in a change-point model with heavy-tailed errors, the outliers being able to create 
problems in the detection of the jumps. Hence the ideas to first consider a quantile regression. To 
be more precisely, if the errors [Ei)\<i< n of the regression model are such that JP[e.i < 0] = r, then 
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the rth quantile regression is considered, i.e. the regression parameters are found by minimizing 
the function p T (e) = Ya=i £ i[ T ^ei>o — (1 — T )^Ei<o\- The choice of r = 1/2 yields the median 
regression and the Li-estimator, also known as least absolute deviation (LAD) estimator. 
Moreover, when the model has a very large regressor variable number a penalty is necessary to 
estimate simultaneously the parameters on every segment and to eliminate the irrelevant regres- 
sors without crossing every time by a hypothesis test. The SCAD (Smoothly Clipped Absolute 
Deviation) and LASSO penalties have the advantage of selection and parameter estimation. It was 
established that these two methods have the oracle properties in a model without change-points: 
the zero components of the true parameters are estimated (shrunk) as with probability tending to 

1 (also called sparsity property) and the nonzero components have an optimal estimation rate (and 
is asymptotically normal). See Wu and Liu (2009) for the SCAD method rth quantile regression 
and Xu and Ying (2010) for the LASSO-type method in a median regression, both models without 
change-points. Recall also for a median regression in high dimension the paper of Wang(2012), 
where a L\ penalized least absolute deviation method is considered, when the overall variable num- 
ber is larger than the observation number. 

In a multiple change-point model, the break estimation could affects the estimator properties. This 
is the main interest of this paper. The difficulty to study a change-point model results first from the 
dependence of the model of two parameter type: the regression parameters and the change-points. 
A change-point linear model in high-dimension was also considered by Ciuperca(2013) but under 
stronger assumptions that the errors have mean zero and bounded variance. An adaptive LASSO 
estimator was studied. It was proved that it has the oracle properties on each estimated segment. 
However, when the model contains outliers, the adaptive LASSO estimator not be may robust and 
moreover, the observation number should be greater than the parameter number to be estimated. 
In the present work we restrict our attention to the quantile regression in high-dimension with 
multiple change-points when the classical conditions on the errors do not occur. We also carry out 
simulations to investigate the properties of the two proposed estimators. 

The paper is organized as follows. The model and assumptions are introduced in Section 2. In 
Section 3, the SCAD estimator in a change-point model is proposed and its asymptotic behavior is 
studied. Next, LASSO-type estimator is given in Section 4. For both methods, the oracle proper- 
ties, convergence rate and asymptotic distribution of the estimators are obtained. Section 5 reports 
some simulations results which illustrate the methods interest. In Section 6 we give the proofs of 
Theorems. Finally, Section 7 contains some lemmas which are useful to prove the main results. 

2 Model and general notations 

In this section we introduce the models without and with change-points, general assumptions, 

notations and also general results. 

We consider the linear model without change-points 



Yi = X?0 + Si 



i = 1, • • • , n 
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where the response variable Y{ is an univariate random variable, Xj G MP is a p- vector of regressors 
(covariates) and the is the error. The errors (£i)i<i< n are independent identically distributed 
(i.i.d) random variables. The regression parameters are </> E T C M p , with T a compact set and 
4>° true value (unknown) of the parameter <f>. Contrary to the classic suppositions for a regression 
model, we do not impose the condition that the mean of errors £j is zero or that their variance is 
bounded. 

All throughout the paper, vector and matrices are written in bold face. 

With regard to the errors £j and the design Xj, we make the following assumptions: 

(Al) Let / be the density of e% and F its distribution function. We suppose that /(0) > 0, 

F(0) = r, \ f(y) — /(0)| < c| 7/| 1//2 , for all y in a neighborhood of 0. The quantile r is a real number 

in the interval (0, 1). 

(A2) (Xj) is a deterministic sequence, such that n~ l Y^7=l ■^i^H converges, as n — > oo, to a non 
negative definite matrix; 
(A3) (Xj) uniformly bounded. 

These conditions are typical for a quantile regression (see e.g. Koenker, 2005). These assumptions 
are also classic conditions for a model estimated by LAD method: the first condition is found in 
Babu(1989) and the last two in Bai(1998). 

It is of interest to note that by assumption (Al) we have P[ei < 0] = r, but the expectation 
JE\ei\ cannot exist. A regression model (pQ) with the errors (ej) satisfying the condition P[ei < 0] = r 
is called quantile regression. In order to estimate the unknown regression parameter </>, we consider 
the function 



p T {r) = r[rl r>0 - (1 - r)l r < ] (2) 
and the corresponding estimator 

n 

^ = argmin Vp T (yj - X*0). (3) 

In order to study the quantile regression and the estimator ([3]), let be the processes 

Gf } (0;0°) = Pr (e t - X*(0 - 4P) - p T { £i )), ^ T) (<£;0°) = £?=i Gf\cf>;cf> ), 

Di = (1 - r)l £i < - rl £i>0 , W n = Y% =1 D i X$, (4) 

i?S T) (0; ct>°) = GS r) (0; <t>°) - AX*(0 - 0°). 

Obviously lE[Di] = 0. The relation between and F§ is 

n 

ct>°) - E[S£H4>\ 0°)] = E^ (T) ^ - ^[^(0; 0°)]] + W n {cp - 0°). (5) 

i=l 
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For the parameter regression vector <f>, we shall use the notation <f> = (</>,i, • • • , <j>,p)- 

Throughout the paper, C denotes a positives generic constant not dependent on n which may take 

different values in different formula or even in different parts of the same formula. For a vector 



v = (v±, ■ ■ ■ ,v p ) let us denote |v| = (\vi\, • • • , \v p \). On the other hand, 1 1 v 1 1 2 is the Euclidean norm 
and ||v||i = Y2i=i \ v i\ ^ s the ^1 norm - All vectors are column and v denotes the transposed of v. 
For coherence, we try to use the same notations as in the paper of Wu and Liu(2009). By elementary 



N<|x'W>-</>° 



This 



calculations, we obtain, with the probability 1, \RY , (</>; <£°)| < |X*(</> - <£°)|1 

inequality, the definition of and the assumption (A3) allow to obtain Qn\<j>; <j>°) < Cn\\«f> — 
4>°\\2- Following result proves that for every parameter (f> and for every quantile order r, the process 
g\ t \4>;4>°) has positive expectation, indifferently of the design X$. 

Proposition 2.1 Under assumption (Al), we have, for all cj> G V, JE[Gj T \(f); cf) )] > 0. 



Remark 1 In Bai(1998), the behavior in a neighborhood of <p° of the process Qn^ — JE[Qn 
obtained in the particular case r = 1/2. By a similar demonstration, we can prove that the re- 
sult holds in general, for any r G (0,1): let be a positive sequence (c n ) such that c n — > and 
nc n /\ogn — > 00. Under the assumptions (Al)-(A3), there exists a constant C > such that 



1M1 



■is 



Ve > 0, P 



su PU-4, Q \\ 2 <c n 



1 



^ r) (0;^°)-iE[^ rJ (0;0 )] 



> e 



< exp(— e 2 nc^C). 



The proof sketch of this remark is given at the end of Section 6. 
As a consequence of this Remark, for any e > 0, we have 



limsup sup 

n->oo \ ||0-4,°|| 2 < Crl 



^(gW(^0 o )-I?[g(;)(^;0 )] 
nc± \ 



a.s. 



It is well known that the estimator (|3|) has all nonzero components. For estimations and 
choosing the regressors simultaneously, penalized methods can be used: SCAD or LASSO-type. 
These estimation methods become all the more interesting for a model with K change-points 



Yi — X'^lix^j + X^l^j^ H + X t i cf) K+1 li K <j< n + £j 



,n, 



(6) 



where 1... denotes the indicator function. 

The model parameters are the regression parameters (cf) 1 , • • • , <Pk+i) an d the change-points (h, - ■ ■ , Ik)- 

The true values (unknown) are (0?, • • • , (f> K+l ), ■ ■ ■ , l° K ), respectively. 

Concern the distance between two consecutive change-points, we impose the assumption 

(A4) l r+1 -l r > n 3 / 4 , for all r = 0, 1, • • • , K. 



In order to study the properties of the penalized estimators in a model with breaking, we need 
corresponding results obtained without change-points when r = 1/2: by Ciuperca(2011b), Xu and 
Ying(2010) and for a some r G (0, 1) by Wu and Liu(2009). 
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In the next section we investigate theoretical properties of the smoothly clipped absolute deviation 
(SCAD) method in a change-point model. 



3 SCAD estimator 

We begin this section by recalling the SCAD estimator for the quantile regression model (pQ) without 
change-points, introduced by Fan and Li(2001) and developed later by Wu and Liu(2009) 



d)^'^ = arg min I 



i=i 



(7) 



The penalty p\(<fi t j) is defined by its first derivative 

p'M) = A {%, 3 |<a + (Q ( Q ~_'t)A )+ 1 l^l>4 ' ^ = ■ ■ ■ >P> ^ 

with A > 0, a > 2 deterministic tuning parameters. For real x we use the notation sgn{x) for the 
sign function sgn(x) = & when i/O and sgn(0) = 0. We also denote x + = max{0,x}. 

» (t A) 

In order to study the estimator n ' , introduce the function 

Gf A) (0; 0°) = 6^(0; 0°) + [ Pa (|0|) - P A (|0°|)]%, i = 1, ■ ■ • , n, 
with l p = (1, • • • , 1) a p x 1 vector and P\((j>) = (px((p,i), ■ ■ ■px((p,p)) also p x 1 vector, with 

= (0,1V ,0,p)- 

We first give the KKT conditions for the quantile model (HD without change-points. 

For the estimator n ' given by the relation ([7]) , let us consider the index set of the variables 
selected by the SCAD method 

with <f>n'P * ne component of 0^ ' . 



Proposition 3.1 For the estimator the KKT conditions are 



for j eA n :T Y^=l X ij ~ EiLl X H\ i<X t^ = nXs 9n(4>n,f) i 



/or j £ An ■ 



r £? =1 X ij ~ £?=i X «\<xf^' x) 



< nA. 
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For the model ©, in order to study the SCAD estimators of the regression parameters (<p 1 , • • • , 4>k+i)> 
and of the change-points (h, - • • , Ik), let us consider the function 

K + l l r 

S(h,--- ,1k) = J2 , inf , K+1 E [pr(^-X^ r )+p A;(ir _ lW (|^|)lJ. (9) 
r=1 («^i.-,0jf+i)er i=lr _ 1+1 L 

In each interval another penalty Px-,(i T _i;i r ) can be considered, with Zq = 1 an d Ik+1 = F° r 

simplicity of notation, we denote the penalty of © by P\^i r _ i; i r ) for S(l±, • • • , Ik) and by p^.^o .jo) 

for the true parameters, but it is understood that the series A are in fact \(i r _ Vi i r ), A^o .jo), 
respectively. The change-point estimator is 

(zt' A) ,--- ,ZJ' A) )= argmin S(h, ■ ■ ■ , l K ), (10) 



with the function S defined by Q. Between two consecutive change-points l r -\ and Z r , the SCAD 
estimator of the corresponding regression parameter cf) r is 

(A) ^ r ^ T 

4>{l' r -r,lr) = ar g min E - x '^r) + PA ; (z r -_ i; z I -)(l < / > rl) 1 P = argmin ^ Gf' A) (0 r ; 0°). 

^ «=i r _i+i " ^ i=i r _i+i 

* (t A) 

The following three theorems state the asymptotic behaviors of the estimators (jlOp and of 4>(il_ r j r )- 
The first result gives the convergence rate of the change-point estimator. 

Theorem 3.1 Under the assumptions (Al)-(A^), with the tuning parameter (A n ) a sequence con- 
verging to zero and for a deterministic sequence (c n ), such that c n —> 0, nc n j log n — > oo and 
^nCn 2 — > 0, as n — > oo, we have l^'^ — Z° = Ojp(l), for every r = 1, • • • , if. 

Remark 2 VFe /iGroe following relations between the sequences (A n ) and (c n ) ; A„ « « c„. 
Example of sequence (c n ): c 2 n = A n logra. 

By the Theorem 1 of Wu and Liu(2009), for tuning sequence Xn r _ lt i r ) converging to zero as 
n — > oo, we have that the convergence rate of the estimators of <f> in each segment is of order 

(Z° — Z^L-l) -1 / 2 . Hence, taking into account Theorem 13. 1} we deduce that ||0^-(' t ,a) ,-(t,ak — (firW? = 



Op (j^> - 1 



A) f(T,\) 

r-1 



-1/2 



We suppose that for each interval we have that the matrix (l r — l r -i)~ l Yli=i r 1 +i XiX* con- 
verges to C r , as n — > oo, with C r a non-negative definite matrix, which can be singular. Let us 
denote by C® the limiting matrix for the true change-points Z°, r = 1, • • • , K. We also denote by 
C® k j the (Zc,j)th component of matrix C°. 
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The following result proves that on every segment, the SCAD estimator for the regression pa- 
rameters has the oracle properties: nonzero parameters estimator on each estimated segment is 
asymptotically normal and zero parameters are shrunk directly to with a probability converging 
to 1. Let us underline that the limiting distribution not depend on the penalty p A , but only of the 
quantile order r. 

Theorem 3.2 Under the assumptions (Al)-A4), if \(i r _ lt i r ) — > 0, (l r -i — Ir) 1 ^ 2 ^(i r _ 1 .i r ) °o, a s 
n — > oo, we have 

£, 



n— >oo 



^^^(l-r)// 2 ^)^)- 1 ), where q r = Card^*^}, Q° r = (C° kj ) kJeA(i J ^ ^ is a q r x q r 
matrix. 



(ii) lim n _ > . 00 



1, With: Ayo^jo) = {j;<P r j o}, 



•^(UL^) - {* t^wj + o} - [r, $$&# M)tj + o} A r . 

We denoted by 4>^ the sub- vector of cf> containing the corresponding components of A r . Observe 
also that q r is the true number of nonzero components in rth segment. 

The next theorem establishes the asymptotic distribution of the change-point SCAD estimator. 
The limit distribution depends on the quantile order r. 

Theorem 3.3 Under the assumptions (Al)-(A4), with the tuning parameter sequence as in Theo- 
rem \3.1\ and with the condition \qo io^I® ~ ^r-i) 1 ^ 2 = we have that the estimator of the rth 

change-point (f^'^ — /J?) has the asymptotic distribution axgrnin^gg ^r]> w ^ ^ro = ^ anc ^ 



for j = 1, 2, • • • : Z?j = Etto+i [pM - X*(0 r ° - $ +1 )) - p T (si)] ; 



- for j = -1, -2, • • • : Zl r ] = Ztfi+i [p-& + X *^° ~ #+i)) " ^( e *)] • 



It is important to note that, either the condition (l r ~i — lr) 1 ^ 2 ^(i r _ 1 ,i r ) — > oo are satisfied, thus 
the oracle properties hold, or A^o 10^(1^ — ^-1) = CQ' t nus we nave the asymptotic distribution 
of change-point estimator. Cannot both be true simultaneously. 



4 LASSO-type estimator 

An important theoretical fact is that, as Zou(2006) showed recently, the oracle properties do not 
hold for the LASSO estimator. We have just seen that considering the SCAD method, the obtained 
estimators have this property. But the last method is difficult to put into practice with regard 
to numerical algorithms. Thus, Xu and Ying(2010) proposed, for model (pQ), that the tunning 
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parameter A change from one component to the other of the parameter 0. 
In this section the median model ( r = 1/2) is studied. 

First, the model ([TJ without change-points, mentioned in Section 2, is considered. The parameter 
cj> is estimate by 

4>n = argmin (V % - X\(j)\ + A* |0| ) . (11) 







Compared with the method seen in the previous section, now, the tuning parameter A n = (A nj i, • • • , X n ,p) 
is a random p-vector with different components. The fact that A ra has different components, makes 
possible that the estimator 4> n have the oracle property, obviously, choosing the components of A n 
in a judicious way. 

For the regression model ([T]) without change-points, and for the estimator <fi n given by (|11|) . consider 
the index set of estimator nonzero components = {j'i'Pnj ^ 0} where ^ ■ the jth. component 

of 4> n . Similar to the Proposition l3.lt we obtain that the KKT relations are in this case: 

" ES=i X H ■ s 9n(Yi - X$£) + A nJ • sgn(^j) = 0, for all j G A%, 



Y,tiXij-89n(Y i -X<<l> n ) 



< A„,,-, for all 3<£At, 



IL - *.L 



with A n j- the jth. component of \ n and 4>^ ■ of <p n . 
Consider now the change-point problem ([6]), with K (known) changes. For this estimation method, 
the change-point estimator is 



(!i,---Jk)= argmin V inf 



K+1 l r 

£ 

i = l r -l + l 



A* 

\Yi-X$<f» r \+ n ' {lr ~ ulr) 



r-1 



Taking into account that a particular case (r = 1/2) to the quantile regression is considered, 
following processes are introduced 



Gf /2) (0,0°)^|^-X*(^-^°)|-|e 4 |, i = l,... ,n 

ltui*)&> *°) = ^ 1/2) ^' *°) + & - ^rX^CW - i = 3i + 1. • • • .ia 



i ^ ^ = rr(i/2)/ . ,o\ , / ■ _ ■ h\* n^.i _ i-aOin .• .- , -, _■ ( 12 ) 



with < ji < j% < n and 0° the true parameter. In the particular case j\ = and j'2 = n, let us 
denote A n; ( ,„) by A n . 

Observe that, since we will study the model © with change-points, by the least absolute deviation 
method (r = 1/2) with LASSO-type penalty, the related results obtained when there is no penalty 
by Bai(1998), Ciuperca(2011b) are needed. 

Following result yields that, even if the penalty is different, this estimator has the same conver- 
gence rate as the estimator obtained by the SCAD method. 
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Theorem 4.1 If the tuning parameter A rij (; r _ 1) ; r ) satisfies the conditions ||A^« 1 ; r )||2 - ^ oo, (Z r 

/ r _i) _1 / 2 ||A^ j^||2 — > M > 0, under the assumptions (A1)-(A4), we have 1% — 1® = Ojp(l), 
/or every r = 1, • • • , -ftT. 



n— >-oo 



Combining the Theorem 14.11 and the -y/ra-consistency of the parameter estimator in a model 
without change-points (see Theorem 2 of Xu and Ying, 2010), we have that the convergence 
rate of the regression parameter LASSO-type estimator on each segment is \\4>^l ^ — <f>r\\2 = 

For this method type, the most important is to verify that if the oracle properties are preserved 
in a change-point model. The sparsity property is the most interesting and it risk to be influenced 
by the change-point estimation. We would like to point out that, due to a penalty different, the 
proof of this result differs from that for the SCAD estimator. 

Theorem 4.2 Under the assumptions (Al)-(A4), with the tuning sequence (A ni rt r _ lj ; r )) as in The- 
orem {4~l\ we have: 

(i) (tik-M ~ tf) AL = (l°r-l°r-x) l/2 ~ tf) AL A/" (0, l/(4/ 2 (0))(^)- 1 ; 

(ii) limn—^oo IP 



aL aL aL 



l, ^••^o_ i>J o ) = {i;C-^o}Mj 



{r, % fir _ ifi)J * o} and ^ [j; jfe^j * o} = noted ^ 



It is worthwhile to mention that, if the same model © is estimated by least squares, under 
certain conditions on design (Xj), with a LASSO penalty, the sparsity property (i.e. the claim (ii) 
of the Theorem 14. 2p . is not satisfied. Moreover, as the model contains change-points, this condition 
is more difficult to check on each interval that has random bounds. Then, an adaptive LASSO 
method can be considered downside to remedy this. But, it is necessary that in each segment 
(l r -i,l r ) parameter number is smaller than observation number l r -\ — l r . On the other hand, the 
adaptive LASSO for least squares method holds only under the assumptions that the errors have 
mean zero and bounded variance. 



Finally, we give the asymptotic distribution of the change-point estimator. 

Theorem 4.3 If ||A^n 1 / r )ll2 = ojp((l r — l r -i) 1 ^ 2 ), under the assumptions (A1)-(A4), we have 
(lr — 1®) — > arg mimpw zj; \- , with Z^ 1 -^ = the same process as in Theorem \3.3[ for r = 1/2. 

n— too J ' >■> 

Remark 3 It is interesting to note that the asymptotic distributions of the LASSO-type change- 
point and parameter regression estimators are the same as for the SCAD estimators, even if the 
penalty has changed. 
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Remark 4 By LASSO-type method proposed in this Section, we have simultaneously the oracle 
properties and the asymptotic distribution for the change-point estimators, when 1 i r )\h = 

op{(lr — Ir-i) 1 ^ 2 )- Recall that for the general quantile regression, by SCAD method, we have either 
one or the other. 

An example of tuning random sequence \i ) a r _ l i T )- m each segment (/ r _i,/ r ) the LAD 

estimator ^>ql_ u i r ) of <\> r is calculated by a corresponding relation to ([3]), for r = 1/2. Obtained 
estimators have all nonzero components and they have a convergence rate v n n_ x ^ \ to the true 
parameter, with (l r — l r _i)v n n _ x ; \ — > oo (see Theorem 1 of Ciuperca, 2011b). Consider then 

x ( ■ '^'i 

*n,(lr-l,lr) ~ I ?(V2) I ' ' ' ' > 7T(T72) [ ' 

5 Simulation study 

We now give some simulation results. All simulations were performed using the R language. To 
calculate Least squares estimation the function Im was used. While, for the estimations quantile, 
SCAD and LASSO-type, the function rq of the package quantreq were called. To compare these 
estimates when the classical conditions on the error distribution do not occur, we consider also the 
adaptive LASSO estimation using the function Iqa of the package Iqa and quantile estimation with 
LASSO penalty. 

The number of phases is assumed to be known: the models contain two change-points (three phases). 
We consider 10 latent variables X x , ■ ■ ■ ,X W with X 3 ~ jV(2, 1), A 4 ~ jV(4, 1), X 5 ~ jV(l, 1) and 
Xj ~ Af(0, 1) for j £ {1, 2, 6, 7, 8, 9, 10}. The true values of the regression parameters (coefficients) 
on the three segments are respectively: (1, 0, 4, 0, —3, 5, 6, 0, —1, 0), (0, 3, —4, —3, 0, 1, 2, —3, 0, 10), 
(1, 3, 4, 0, 0, 1, 0, 0, 0, 1). Three error patterns were considered: exponential, Cauchy and standard 
normal distributions. The tuning parameter A n .« r _ li / r ) is log(/ r — l r -i) ■ lp for the quantile esti- 
mation with LASSO penalty, (l r — l r -i) 3 ^ 2 for SCAD and LASSO-type estimations. For adap- 

tive LASSO method, the adaptive penalty is 1 4>(i r _ 1 ,i r )\~ 9 ^ 40 • For each model, we generated 500 
Monte-Carlo random samples of size n, with n = 60 or n = 200. The percentage of zero coeffi- 
cients incorrectly estimated to zero(true 0) and the percentage of nonzero coefficients estimated to 
zero(false 0) are computed (see Tables [TM]) by least squares(LS), quantile (QUANT), quantile with 
LASSO penalty(QLASSO), SCAD and LASSO-type methods. Since the asymptotic distribution 
of the change-points estimators can not be symmetric, in each table we also give the median of 
the change-point estimations. Because the results by the SCAD method are more poor than by 
the LASSO-type method, and also because there may be convergence problem (the function rq not 
responding), in Tables HE] the SCAD estimator is no considered. 

The outliers of the errors do not affect the precision of the change-point estimations, by six methods, 
while the sparsity property of the QLASSO and adaptive LASSO are affected. The LASSO-type 
method provides very satisfactory estimations in any case even for small sample size. The only less 
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favorable result is obtained for n = 60 when the errors are exponential. The percentage of false 
zero is large enough. 



6 Proofs of Theorems and Propositions 



In order to simplify the proofs of theorems and propositions, we give in this section their demon- 
strations and in Section 6 some lemmas and their proofs which will useful. 

Proof of Proposition 12.11 

Let us consider the notations hi ((f)) = X*(0° — (f>) and F(x) for the distribution function of Ej. By 
definition ]E[g\ t) ((f>; (f) )} = J R [p T (x + hi((f>)) - p T (x)]dF(x). Using F(Q) = t, a simple algebraic 
computation gives 

-hi(</>) 

[\hi((f))\-x]dF(x), Hhi((f))<0. (13) 



and 



JE[Gf ) ( ( f>-4P)}= f [\hi((f))\+x}dF(x), iffci(0)>O. 

J-h,,{<t>) 



Taking into account the relations (|13|) and (|14p we can write 

,,0 - hi(4>) 



(14) 



^[G^(0;0 u )]>l, t 



(0)>O 



2 



[\hi((f))\+x]dF(x) + l h . 



(4>)<o 



\hi((j>)\-x}dF(x) 







> 1 



M<t>)\ 



hi((f>)>o- 2 

^(0)1 



dF(s) + 1 M0) 



<0" 



M<t>)\ 



dF(x) 



1 



hi(4>)>0 



[F(0) - F(- 



hi((j>) 



)] + l 



M<W<0 



) - no)] 



> 0. 



Hence, JE[g\ t) (</>; 0°)] > , for alH = 1, • • • , n, </> 6 V. 







6.1 For SCAD estimator 
Proof of Proposition 13.11 

If j G A n - According to the definition ([7]), the SCAD estimator of 4> is the solution of the following 
equation = — ' a ^ — - 
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Adding these, we obtain 



i=l 



i=l 



1 |^ A) l<A + (a-l)A 1 ! 



If j .4 n - In this case E EILi 
Since p' A (|^|) = 



(r,A), 
n,j I 



A • aj^(^)l|^.|<A + {aX ~ l H-r ni ^ ] l \^\>^ [t follows that G 



9G 



(r,A),7(r,A), 



Eft i,n *_>_ri£ 



Ei=i x ij\ i<1Ll ^)\<^- 



-r^UXij + T.U X »\ <n ^ + n\ ■ [-1,1]. Then, |t£™ =1 A^- 







Proof of Theorem 13.11 

The proof is similar to that of Theorem 14.11 It is omitted. The Lemmas 17.21 and 17.51 stated in 
Section 6 are needed. <0 



Proof of Theorem (3~T21 

(i) The statement results from Theorem 13.11 together with Theorem 2(b) of Wu and Liu (2009). 



(ii) By the Theorem 2(a) of Wu and Liu(2009), we have: linin^oo IP 



l. 



The asymptotic normality of the estimators implies: for all k G A^^t.x) fr,\)y we have <f>® k 



0. It follows that 



lim IP 

n— >oo 



\(i£.?& r ' X) ) ~ A (l°r-l,l°r) 



(15) 



By similar arguments that for the Lemma 1 of Wu and Liu(2009), we prove that 



IP 



3k G {!,-•• ,p},k £Aqo _ v io r ),k G A^gfrx 



A) f(r,A) 



The Theorem results from the relations (1151) and (1161). 



(16) 





Proof of Theorem ET3l 

Consider the case j > 0, the case j < is similar. Then = Yl[ r =io + i (0r-i> Using 

the Proposition EH we have: iB[Gf ^{Li, 0°)] > 0. Then, iEfZ^] = ^[gJ t, (0°_ 1s 0°)] -> oo, 

for n — > oo. Thus, for every e > 0, there exists Mi j(E > such that: iEfargmhij zffj < Mi ;e ] > 
1 — e. By Theorem 13. 1( for all e > 0, there exists M2 e > such that for each r = ,K: 



IP[\li T,X) - Z°| < M 2 , e ] > 1 - e. Consider then M € = max{Mi j£ , M 2j£ }. For each \i r \ < M e , let 



us 
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consider the difference S(l± + i\, ■ ■ ■ , l° K + ix) — S(l^, • • • , l° K ), rewritten as 

K l°r 

r=l i=/0_ 1 +i r -i 

+*r [PA;(lO_ 1 ^_i,lo+v)(l^(S-H r -i ) J?+<r)l) ~ PaK^?) (10(1°^,*°) I) 



+ (/?-Z?-l + tr-»r-l) PA(l0(^i+ Jr _ 1 ,iO+i r )|) -Pa(I0(JJo +i) |) 



^(^■-x*^ +i) )- Pr ( £j -)]} 



lp = £>i + Pi + £> 2 -D 3 + Pa- 



Recall that the definition of the function S is given by the relation Q. For studying P2, consider 
definition © of the p' x = Al,i , <A + aA ~it' jl l A >a 7 i <aA < Under condition A ( ,o_ ,c»(Z° - 



/^j^) 1 / 2 = o(l) together the Theorem 13.11 we obtain that D\ , Pi , D3 , P2 = ojp(l). For L>2, since 



0(/o 1 +j r _ 1 zo + j r ) — <fir = Ojp(l® — x ^ 2 , together the assumption (A3), imply the conclusion. 



6.2 For LASSO-type estimator 



Proof of Theorem 14.11 

Obviously, we have 



J(t)+i 

> > min 
<^ 6r 



i=fcj_i,t+l 



J(t)+1 

E 



X! £ *l + K.;(k j - 1}t ;k j ,t 



i=fcj_i,t+l 



> -2(K + 1) sup 

l</<j<n 



inf 

4> 



E ^<U)(*.*°) 



which is, using the Lemma 17.61 — 0.p(max(n Q , A n )), with a 6 (1/2, 1). The rest of proof is similar 
to that in Ciuperca(2013), Theorem 1, using also the Lemma I7TS1 stated in Section 7. The details 
are omitted. 



Proof of Theorem 14.31 

The proof is similar to those of Theorem 2 of Ciuperca(2013). For the without penalty member 
we apply together the Theorem 14.11 (for the rate of convergence) and the Theorem 4 of Bai(1998). 
The penalty member, is of order ojp(l) by assumption [|A^ n _ x i r \\2 = op({l r — Ir-i) 1 ^ 2 ) and 
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ii£cL#) - ^°ii 2 = ® - i°r-ir 1/2 oip(i). 



o 



Proof of Theorem 14.21 

(i) The assertion follows from the Theorem 14 . 1 1 and from the Theorem 3(b) in Xu and Ying (2010). 



(ii) By Xu and Ying (2010), we have: lum^cx, iP 



A L — A L 



= 1. The asymptotic 
o XL *\ 



normality of the estimators implies that, for all k 6 A L u<n< • ■ , ,-, 

i;(ir-l> J r) ' «;('r-l>v)i fc rwoo 



0. Thus limn^oo JP 



4 L C 4 L 



we have 

1. The proof is finished if we show the claim 



IP 



3k €{!,■■■ ,p},k gA (l a _^o y k € ^.^jl) 



that, with the probability 1, 



K), for n — > oo. Since G „4 L , fr !r , we have 



(17) 



We suppose, without loss of generality, that sgn(4>^ L fL ) = 1. Then, using the KKT conditions, 
we have with the probability 1, 



i=£_l+l 



(18) 



On the other hand, since k g" A^ l0 y we have (jP rk = 0, then sgn((jP rk ) = 0. By the proof of the 
Proposition 4 in the paper of Xu and Ying(2010), for <fP k = 0, we have that for every cf> such that 
U,k\\ < Cn-V 2 , 



sgn 



^X ik ■ sgn(Yi - X-0) + \ n , k ■ sgn{(j) >k ) = sgn(<j> >k ) 



8=1 



1. 



Then, taking into account the assertion (i), we apply the previous relation for <j)h L - 



lim IP 

n— >oo 



sgn 



i=l 



1, 



where (f>f~ L - L is the kth component of the random vector ($>(Tl Moreover, by (fl~8 



( - E"=i -X<* • s5n(Yj - X*0 ( fi_ i; ^)) + KtfL_jL )tk ■ sgn{<j)^ j^ ) k) )\ is 0, with the probabil- 
ity 1. Then lim ri _ Sl00 iP [0 = sgn{<j) k )] = 1. Contradiction with (|17p . Thus the claim holds. 
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The demonstration of the Remark[T]is similar to that of Bai(1998). Then we give only the main 
idea. 

Proof of Remark [TJ 

Similar as for the proof of the Lemma 17.14 we obtain, for — 2 II2 < c n n -1 / 2 , 



ncl 



cS \l nc r , 



which converges to for n — > 00. We also have iPfsup^ \n 1 c n 2 [Qn\cj)j; 4>°) — ]E[Qn\(t>j; </>°)]]| > 
e] < Ej PWP^f 4>°) " JE[G { n\^j\ <j>°)]]\ > nc 2 n e], where j = 1, • • • ,rW 2 . By the relations ® 
and (EH), we have IC^C^; 0°) - 1E[g[ t ) <f>°)}\ < C|X<(0 - 0°)| < Cc„. The rest of proof is 
similar to that of the Lemma 4 of Bai(1998). 



7 Lemmas 

We present in this section the lemmas with proofs, which are useful to prove the main results. 

(t) 

Following Lemma gives the asymptotic behavior of the objective function G\ without penalty. In 
fact, Lemma 17. II will be necessary to prove the Lemmas 17.21 and 17.61 where the penalized objective 
functions are studied. 

Lemma 7.1 Under the assumptions (Al), (A3), for all a > 1/2, we have 
supi<z <fc < n I inf^ 6r J2i=i g[ t \4>; <t>°)\ = P {n a ). 
Proof of Lemma 17.11 

By direct calculations i?f \(f> 1 ; cf>°) - #J r) (0 2 ; 0°) 

can be written as 

{Xf(0! - 2 )[(1 - r)ll £i <o - Tl ei>0 ])}+{h - XK0i - O )][rl £ . >x|( ^_^ O) - (1 - rl^x*^^ 

-{[si- X*(0 2 - 0°)][rl e . >x t ( ^_^o ) - (1 - Tl £ ^l i4>2 -4,°))]} = Sx,i + S 2 ,i - S 3 ,i. (19) 
Obviously Si^ = ~K t i (<f> 1 — <p 2 )Di. For S 2> i — S$ } i we have: 

- If Si > X*(0! - 0°). When £j > X-(<£ 2 - 0°), we have S 2) j - S^j = rX-(0 2 - In the case 
e % < X*(0 2 - <t>°), we have % - 5 3>i = rXf(0 2 - + -'xj(0 2 ' - 0°)] < rX*(0 2 - X ). Then, 
in the both cases, S^.i — 5^ < rX*(</> 2 — 0i)- 

-Ifei < X*^-^ ).' Whene* < X*(0 2 -0°), we have S 2 ,i-S 3 ,i = (1 - r)X*(0 1 - 2 ). In the case 
ei >Xf(0 2 -0 o ),wehave5 2 , i -5 3ii = (l-T)[XK0 1 -0 2 )'+ei-'x*(0i-0 )] < (l-rjXj^-^). 
Then, in the both cases, 5 2; j — S^j < (1 — t)X*(0 1 — 2 ). 
In conclusion, with the probability 1, 

S X ,i + S %t - % < XK0i - a )A + max(rXf(0 2 - <t> x ), (1 - r)X^(0i - <fc)) (20) 
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Similarly 

Si,i + S 2 ,i - S 3:i > X*(<fo - 2 )A + min(rX*(^ 2 - 0^, (1 - r)X*(0! - 2 )) (21) 

Hence, the relations (p79|) . (f20|) and (|2T|) . for — 2 ||2 < CrT x l 2 together the assumption (A3), 
imply that 



£?=i [^ T) (0i; 0°) - ^ (T) (0 2 ; 0°) - msPfa )] + ^[^(02; 0°)] 

< CEr=i H X ill2 • 1102 - 0ill2 < Ojpin 1 / 2 ). 
By an argument similar to the one used in Bai(1998), Lemma 3, we obtain 



(22) 



sup 

KKk<n 



inf£G«(0;0°) 

i=i 



< 2 sup sup 

l<fc<n <j> 



£[gS t) (0;0 o )-^[G 4 M (0;0°)]] 



i=l 



On the other hand 

£[G< T } (0; 0°) - «[GW(0; 0°)]] = £[^(0; 0°) - iE[i^(0; 0°)]] + £ D ^ ~ 

i=l i=l i=l 



Let us consider the random process = sup^ ^2i = i[G[ T \4>] 4>°) — lElG^'^ ^(0;0 )]] 

since by Proposition 12.11 1E[G^ (d>; 0°)] > 0, follows that J r fc} fc=1 ... n is a sub-martingale, 
where T\~ = o — field{ei, ■ ■ ■ , which implies, using Doob's inequality JP[sup 1<fc<n ^ > re a ] < 
n~ am G TO iE[£™], C m > 0, with m > 1. We divide the parameter set T into m p / 2 cells, such that 
the cell diameter is < n~ l l 2 . Thus 



Then, 



71 

£ [gS t) (0 i; 0°) - iE[Gf >(0 i; 0°)] - G^te,; 0°) + I?[Gf 0°)] 



i=i 



< 



£ [flWfo; 0°) - JK[i?«(0 i; 0°)] - i^(0 2 ; 0°) + } (0 2 , 0°)] 

i=l 



+ 



^AX*(0!"0 2 ) 



i=l 



and using the relation (|22p. we obtain, with the probability 1, that the last relation is smaller than 
Y^i = \ |X*(0 2 — 0i)| < Cnn~ 1 / 2 = Cn 1 / 2 . By an argument similar to the one used in Bai(1998), 



££=1 [ G ! r) (0r5 0°) " Or! 0°)]] < Gn m / 2 . The rest of proof is similar to that of 



we have IE 



the Lemma 3 of Bai(1998). 
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7.1 For SCAD estimator 



Following result will be useful in the study of the convergence rate of the change-point SCAD 
estimator in a model with breaking. 

Lemma 7.2 Under the assumptions (Al), (A3), for a positive sequence (A n ) n such that A n — > 0, 
we have 



sup 

0<jl<32<n 



J 2 



inf £ G^W] 

1=31+1 



Op(n a ,nX n ). 



Proof of Lemma 17.21 

Using the triangle inequality, we deduce that 



sup 



32 



inf £ g^W; 

«=jl+l 



< sup 

0<jl<j2<™ 



J 2 



inf £ Gf (0;<^ 



+nsup|(p A (|0|)-p A (|0°|))*l 



Considering Lemma 17,11 and the definition of p\, we have that the last relation is smaller than 
P (n a ) + nX n . * 

In the following Lemma, the behavior of Qn^ is studied in the outside of the ball center f3° and 
radius c n . 

Lemma 7.3 Under the assumptions (Al), (A2), with (c n ) a positive sequence such that c n — > 
and nc n /\ogn — > oo, there exists e > such that 



lim inf 

n— >oo 



( inf > e>0 . 



Proof of Lemma 17.31 

Let u in an open subset of W . By the proof of the Lemma 3 of Wu and Liu(2009), taking into 
account the assumptions (Al) and (A2), we have 



n 



M 

2n 



u'(^X,X*)u + 0iP (l). 



i=l 



If c n — > and nc 2 n — > oo, we 



ojp(l). The function G[ T \dr,4>°) is convex, hence Gn'i&rf^) is convex in cf>. Thus, its minimum 
over ||</> — </>°||2 > c n is realized for ||0 — <£°||2 = c n . Then, for 1 1 u 1 1 2 = 1, using the assumption (A2) 



have similarly IE \Qn^ ((f>° + uc «j 0°) 



z f i 4u*(Er=iX i x*)u + 



we obtain that IE 



Qn^ (<P° + uc n] 4>°) = ^y^ nc n(l + °2p(1))- The rest of proof follows using the 



the Lemma 5 of Bai(1998). 
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Lemma 7.4 Under the assumptions (Al), (A2), for two positive sequences (c n ) and (A n ) such that 
X n — > 0, c n — > 0, nc n /\ogn — > oo and \ n Cn 2 ~~ ^ 0> we have 



liminf | inf 

n— >oo 



JI^-</-°l|2>Cn nc 

Proof of Lemma 17.41 

Applying the mean value theorem, we write G[ T ' X \<f); </>°) = G\ T \cf); <p°) + [\<p\ — |0°|]*p' A (<^), with 
= 0°- h{4> - d>°), b G [0, If. Then, using the relation ®, we have 

mf — 2 £ G^\d>; 0°) > inf -W>(* 0°) - 

ll</>-0°l|2>c„ nq* \\4>-4>°h>c n nc l n nci 

Since A„c~ 2 — > 0, for every e > there exists a ra e G N such that A n c~ 2 < e/2. An application of 
Lemma EH leads to inf || ^ 1 1 2 > Ci ^ ^rGn\<f>', 0°) > if and Lemma is proved. 

By similar calculus as in Bai(1998), Lemma 10, we have following result for the estimator ([3]) 
of cf). 

Lemma 7.5 For (X n ), (c ra ) as in the Lemma \l , under assumptions (A1)-(A3), for allni,ri2 £ N 
such that n\ > n u , with 3/4 < u < 1, n% < ri" , v < 1/4, let us consider the model 

Y i = y,\ctP 1 + E i , i = i,---, ni 

Yi = X|02 + £%, * = ni + 1, • • • ,ni + n 2 

with the assumption <\>\ + <\>\. Consider A^ n2 (<f>) = ZZl G^Hfr <t>i) + E?i£+i ' A) (0; 4%) 

and 4>nl+ n2 = argmin^£ A) n2 (0). 

(i) WtnSL ~ <t>% < n x lt2 n^ < n -(— 5 )/ 2 . 

(nJYZiG^ (^;?„ 2 ;^)=O jp (1). 

Proof of Lemma 17.51 

(i) A^ n2 (Vn" +n2 ) < Y&i G^i^; 4>l) = ZZtll CS T) (0?; ^)+n 2 [ PA (|0?|)-p A (|^|)]l P = 
ojp(1) + 0(n 2 ). By Lemma E2 for G\ T ' X \ i = m + 1, • • • , n 2 , we arrive to a contradiction. 
(»; Let z£\<p) Zti G { [\<t>; <t>°i), 4 r) (0) = ES+i - *i(<t> ~ 4%)) ~ pM - Xf(0° - tfjj))] , 

4"' A) (^) = 4" ) (0) + n 2 [p A (| <? i|)-p A (|0?|)]l p , ^' A) (0) = ^ ) (0) + n 1 [p A (|^|)-p A (|^?|)]V Then 

ni+n2 

A^(<f>) = Z^ A )(0)+^ A )(0)+n 2 [p A (|0?|)-p A (|^|)]l p + £ [M^ - X*(0? - $)) - pM)] . 

i=m+l 
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We have |4 r ' A) (^ni+n 2 )l < \tn\^+ n2 )\ + n 2 \\p x (\^+n 2 



Pa (1 0? I) 111 an d similarly that for the 

relation © < C^Ztli I^MI^+l, " 0il|2 + ^IMI^+lj) - Pa (|0?|)||i. The rest of proof 
is similar to that of the Lemma 3(ii) of Ciuperca(2013), taking into account the assumption (A3). 



We have the equivalent of Lemma 4 of the same paper. 



7.2 For LASSO-type estimator 

Lemma 7.6 Under the assumptions (Al), (A3), we have, for a > 1/2, 
h 



sup 

0<Ji<.?2<" 



Ojp(max(n a , sup \\K;(ji,j2)h))- 

0<ji<j2<n 



Proof of Lemma 17.61 

By the Lemma 3 of Bai(1998), Lemma holds for G^ 2 ^ instead of 7 li-^ 1 j 2 )- For 77^, we have 

K(h,h)&> 0°) -^ (1/2) (0; 0°)| = (ii - Ji)- 1/2 |A*. o - lA) (|0| - |0°|)|. Then, by triangular inequality 
together the Lemma 17.11 and the compactness of the set V 



sup 

0<jl<j2<n 



.12 



i=j\+l 



< sup 

0<jl<j2<n 



.12 



inf ^ gS 1/2) (0;0°) 

1=51+1 



+ C SUp ll^n;(iij 2 )lll 
0<71<J2<1 



JP (n a ) + JP (A n ). 







Lemma 7.7 Under the assumptions (Al), (A2), if n 1 ^ 2 \\\ n \\2 — > Ao, wii/i Ao > 0, i/ien 

liminff inf n~ l V ,(0; 0°)j > e. 
Proof of Lemma 17.71 

Using (A2), by the Lemma 6 of Ciuperca(2011b), we have for G^ 2 \ with the probability 1 



lim inf 

n— ¥oo 



inf n" 1 V G< 1/2 W°) ] >|. (23) 



8=1 

We also have the inequality 



o inf ^E^W^ )^,, o inf 1/2 E^ (1/2) ^ ; ^- sup A n(i0i-i0°i)- 
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(24) 

Since ||A n ||2 = Ojp(n _1//2 ) and </> belongs to a compact set, then the last term of the right-hand-side 
of ([Ml) is ojp(n _1 ). Hence 

sup (n^A'JM - |0°|)) < e -, n>n e . (25) 
The conclusion follows, combining the relations (|23p . ()24|) and ()25|) . 



Lemma 7.8 For all n\,ni 6 N swc/i i/iai n\ > n u , with 3/4 < u < 1, n-z < n v , v < 1/4, let us 
consider the model 



Y i = X b i 4>l + e i , i = l,---,m 

3*8 



with the assumption cf)\ ^ eft®. Under the assumptions (A1)-(A3) and (A n ) as in the Lemma 

112 



let us consider < +n2 (0) = £^=1 V^ , ni )(^ *?) + ffi+l ^.nx+n.)^ ^ and 



argmin ^ i+n2 (0). ITien 



rH;E-ii^ (O)m) (^ 1+ n 2) 0i) = Oip(i)- 

Proof of Lemma 17.81 

We denote <£ ni+n2 = argmin^ £^ G< (1/2) (c£, 0?) + £^++l Gf /2) (0, 4>D- Using the assumptions 
(Al) and (A3), by Lemma 10 of Bai(1998) we have that (i) and (ii) are true for ^ ni+n2 and . 
(i) We suppose the contrary ||0 ni -|_ n2 — 0?||2 — n i- On the other hand, we have by definition: 

m+n2 ni+7i2 
< +n2 (0n 1+n2 )< £ n^m+nM^ = £ Gp)(0?,0°)+AUl0?|-|0°|). (26) 

j=ni+l i=ni+l 

By Lemma lO(ii) of Bai(1998), we have: E?=£+i Gf /2) (</>?, 4>l) = op (I), then taking into account 
the relation (|26p . we obtain 



A L ni+n2 {4> ni+n2 ) < 0JP (1) + Ojpin 1 / 2 ). (27) 
On the other hand, using the Lemma [77TI we deduce 



E4m)(^ 1+ n 2 il) > Oj.(ni). (28) 
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There is a contradiction between the relations (|2T[) and ()28|) . 

(ii) Introduce ^ 2 )(Xj) = X*(0 1 — 2 ). For r = 1/2, let us recall the notations given in Lemma 



,(1/2) 



(1/2), 



By the Lemma 7 of Ciuperca(2011b), we have that: Zn^ 2 \cj) ni+n2 ,cf) [ {) = Op(l). Introduce now 



\<t>,4>l) 

l0\ 



t£(0) = 4 1/2) (0)+AL ni+n J|0|-|0°|], Z£(0) ee ^(0)+A| O)ni) [|0|-|0°|]. Thus < +n2 (0) 



(1/2), 



4 1/2j (0)+A; Oini) [|^i-i0?|]+4 1/ ^(^)+E^r+i 
= ^(0) + ^(0) + A; ni)m+n2) [|0?i - 1^|] - 

argmin ^ 1+n2 (0) = argmin^[^(0)+^(0)]. But |t£(0„ 1+n2 )| < |tk 1/aj (^ 1+na )|+|Aj >ni+na) [|^ 



•Nrai+712 
i + l 



(o,m)L 



+Aj ni!m+n2) [|0|-|0 
• Then 0„ 1+n2 



|0i|]| and using the elementary inequality ||a| — 16|| < \a— b\, we have |£^(0 ni 



+n,2i 



— Z_(j=ni+1 



L 

< t > m+n 2 ' 



-712 I 



01 111 • ll x illi + |A( ni)r j 1+n2 )[|0n x +n 2 l ~ I0il]l ^ °p{ 1 )i we have used W and the assumptions (A3) 
and ||A( ni)m+n2 )||2 = o P (nl /2 ). 

We have also Z£(0?) = i£(0°). Thus > inf^(0) + ^(0)) = Z^ 1+n2 ) + *n(0n 1+ n 2 ) = 

■^n (0ni+n 2 ) — I°Jp(1)I — m ^ (0) ~~ I °Jp(1) I • But inf <^ Z n (0) = (3/p(l) . The rest of proof is similar 
to that of the Lemma 3(h) of Ciuperca(2011a). 
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Table 1: Median of change-point estimations, percentage of true and of false by LS, QUANT, 
QLASSO, SCAD, LASSO-type and adaptive LASSO methods for n = 200, K = 2, Z° = 30, 
Z° = 100, Ei ~ £xp(-1.5). 



Method 


LS 


QUANT 


QLASSO 


SCAD 


LASSO-type 


aLASSO 


median of (Zi, Z2) 


(31,100) 


(31,100) 


(31,100) 


(30,100) 


(30,100) 


(30,100) 


% of trues 








46 


75 


97 


94 


% of false 








1 


3 


3 


8 



Table 2: Median of change-point estimations, percentage of true and of false by LS, QUANT, 
QLASSO, SCAD, LASSO-type and adaptive LASSO methods for n = 200, K = 2, l\ = 30, 
1% = 100, Ei ~ Af(0, 1). 



Method 


LS 


QUANT 


QLASSO 


SCAD 


LASSO-type 


aLASSO 


median of I2) 


(31,100) 


(30,100) 


(30,100) 


(30,100) 


(30,100) 


(30,100) 


% of trues 








37 


65 


98 


94 


% of false 








0.5 


5 


2 


8 



Table 3: Median of change-point estimations, percentage of true and of false by LS, QUANT, 
QLASSO, SCAD, LASSO-type and adaptive LASSO methods for n = 200, K = 2, = 30, 
I® = 100, Ei ~ Cauchy. 



Method 


LS 


QUANT 


QLASSO 


SCAD 


LASSO-type 


aLASSO 


median of (lijfo) 


(31,100) 


(30.5,100) 


(30,100) 


(30,100) 


(30,100) 


(30,100) 


% of trues 








36 


62 


95 


48 


% of false 








1 


3 


3 


12 
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Table 4: Median of change-point estimations, percentage of true and of false by LS, QUANT, 
QLASSO, LASSO-type and adaptive LASSO methods for n = 60, K = 2, /? = 17, Z§ = 40, 
Si ~ S xp(— 1.5). 



Method 


LS 


QUANT 


QLASSO 


LASSO-type 


aLASSO 


median of(Zi, Z2) 


(18,40) 


(18,40) 


(18,40) 


(18,40) 


(17,40) 


% of trues 








60 


91 


75 


% of false 








27 


27 


17 



Table 5: Median of change-point estimations, percentage of true and of false by LS, QUANT, 
QLASSO, LASSO-type and adaptive LASSO methods for n = 60, K = 2, = 17, f 2 = 40, 
e,~AA(0,l). 



Method 


LS 


QUANT 


QLASSO 


LASSO-type 


aLASSO 


median of(Zi, I2) 


(18,40) 


(18,40) 


(18,40) 


(17,40) 


(17,40) 


% of trues 








51 


92.5 


82 


% of false 








7 


13 


15 



Table 6: Median of change-point estimations, percentage of true and of false by LS, QUANT, 
QLASSO, LASSO-type and adaptive LASSO methods for n = 60, K = 2, = 17, 1% = 40, 
£i ~ Cauchy. 



Method 


LS 


QUANT 


QLASSO 


LASSO-type 


aLASSO 


median of(Zi, I2) 


(17,40) 


(18,40) 


(18,40) 


(17,40) 


(17,40) 


% of trues 








48.5 


82 


43 


% of false 








18 


26 


16 
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